MEDIA: a semantically annotated corpus of task oriented dialogs in French
Identifieur interne : 003930 ( Main/Exploration ); précédent : 003929; suivant : 003931MEDIA: a semantically annotated corpus of task oriented dialogs in French
Auteurs : Hélène Bonneau-Maynard [France] ; Matthieu Quignard [France] ; Alexandre Denis [France]Source :
- Language Resources and Evaluation [ 1574-020X ] ; 2009-12-01.
Descripteurs français
- Pascal (Inist)
English descriptors
- KwdEn :
- mix :
Abstract
Abstract: The aim of the French Media project was to define a protocol for the evaluation of speech understanding modules for dialog systems. Accordingly, a corpus of 1,257 real spoken dialogs related to hotel reservation and tourist information was recorded, transcribed and semantically annotated, and a semantic attribute-value representation was defined in which each conceptual relationship was represented by the names of the attributes. Two semantic annotation levels are distinguished in this approach. At the first level, each utterance is considered separately and the annotation represents the meaning of the statement without taking into account the dialog context. The second level of annotation then corresponds to the interpretation of the meaning of the statement by taking into account the dialog context; in this way a semantic representation of the dialog context is defined. This paper discusses the data collection, the detailed definition of both annotation levels, and the annotation scheme. Then the paper comments on both evaluation campaigns which were carried out during the project and discusses some results.
Url:
DOI: 10.1007/s10579-009-9103-2
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream Istex, to step Corpus: 001701
- to stream Istex, to step Curation: 001682
- to stream Istex, to step Checkpoint: 000A37
- to stream Hal, to step Corpus: 003005
- to stream Hal, to step Curation: 003005
- to stream Hal, to step Checkpoint: 002F30
- to stream Main, to step Merge: 003A08
- to stream PascalFrancis, to step Corpus: 000255
- to stream PascalFrancis, to step Curation: 000772
- to stream PascalFrancis, to step Checkpoint: 000238
- to stream Main, to step Merge: 003C72
- to stream Main, to step Curation: 003930
Le document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title xml:lang="en">MEDIA: a semantically annotated corpus of task oriented dialogs in French</title>
<author><name sortKey="Bonneau Maynard, Helene" sort="Bonneau Maynard, Helene" uniqKey="Bonneau Maynard H" first="Hélène" last="Bonneau-Maynard">Hélène Bonneau-Maynard</name>
</author>
<author><name sortKey="Quignard, Matthieu" sort="Quignard, Matthieu" uniqKey="Quignard M" first="Matthieu" last="Quignard">Matthieu Quignard</name>
</author>
<author><name sortKey="Denis, Alexandre" sort="Denis, Alexandre" uniqKey="Denis A" first="Alexandre" last="Denis">Alexandre Denis</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:62FE38FFD0D92679441FEF1AFB36859AC3BC3A98</idno>
<date when="2009" year="2009">2009</date>
<idno type="doi">10.1007/s10579-009-9103-2</idno>
<idno type="url">https://api.istex.fr/ark:/67375/VQC-VRS28KX7-6/fulltext.pdf</idno>
<idno type="wicri:Area/Istex/Corpus">001701</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Corpus" wicri:corpus="ISTEX">001701</idno>
<idno type="wicri:Area/Istex/Curation">001682</idno>
<idno type="wicri:Area/Istex/Checkpoint">000A37</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Checkpoint">000A37</idno>
<idno type="wicri:doubleKey">1574-020X:2009:Bonneau Maynard H:media:a:semantically</idno>
<idno type="wicri:source">HAL</idno>
<idno type="RBID">Hal:inria-00424619</idno>
<idno type="url">https://hal.inria.fr/inria-00424619</idno>
<idno type="wicri:Area/Hal/Corpus">003005</idno>
<idno type="wicri:Area/Hal/Curation">003005</idno>
<idno type="wicri:Area/Hal/Checkpoint">002F30</idno>
<idno type="wicri:explorRef" wicri:stream="Hal" wicri:step="Checkpoint">002F30</idno>
<idno type="wicri:doubleKey">1574-020X:2009:Bonneau Maynard H:media:a:semantically</idno>
<idno type="wicri:Area/Main/Merge">003A08</idno>
<idno type="wicri:source">INIST</idno>
<idno type="RBID">Francis:10-0023530</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000255</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000772</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000238</idno>
<idno type="wicri:explorRef" wicri:stream="PascalFrancis" wicri:step="Checkpoint">000238</idno>
<idno type="wicri:doubleKey">1574-020X:2009:Bonneau Maynard H:media:a:semantically</idno>
<idno type="wicri:Area/Main/Merge">003C72</idno>
<idno type="wicri:Area/Main/Curation">003930</idno>
<idno type="wicri:Area/Main/Exploration">003930</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a" type="main" xml:lang="en">MEDIA: a semantically annotated corpus of task oriented dialogs in French</title>
<author><name sortKey="Bonneau Maynard, Helene" sort="Bonneau Maynard, Helene" uniqKey="Bonneau Maynard H" first="Hélène" last="Bonneau-Maynard">Hélène Bonneau-Maynard</name>
<affiliation wicri:level="3"><country xml:lang="fr">France</country>
<wicri:regionArea>LIMSI–CNRS, Université Paris-Sud 11, Bât. 508, BP 133, 91403, Orsay Cedex</wicri:regionArea>
<placeName><region type="region" nuts="2">Île-de-France</region>
<settlement type="city">Orsay</settlement>
</placeName>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">France</country>
</affiliation>
</author>
<author><name sortKey="Quignard, Matthieu" sort="Quignard, Matthieu" uniqKey="Quignard M" first="Matthieu" last="Quignard">Matthieu Quignard</name>
<affiliation wicri:level="3"><country xml:lang="fr">France</country>
<wicri:regionArea>LORIA, Campus Scientifique, BP 239, 54506, Vandoeuvre-lès-Nancy Cedex</wicri:regionArea>
<placeName><region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Vandœuvre-lès-Nancy</settlement>
</placeName>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">France</country>
</affiliation>
</author>
<author><name sortKey="Denis, Alexandre" sort="Denis, Alexandre" uniqKey="Denis A" first="Alexandre" last="Denis">Alexandre Denis</name>
<affiliation wicri:level="3"><country xml:lang="fr">France</country>
<wicri:regionArea>LORIA, Campus Scientifique, BP 239, 54506, Vandoeuvre-lès-Nancy Cedex</wicri:regionArea>
<placeName><region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Vandœuvre-lès-Nancy</settlement>
</placeName>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">France</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="j">Language Resources and Evaluation</title>
<title level="j" type="abbrev">Lang Resources & Evaluation</title>
<idno type="ISSN">1574-020X</idno>
<idno type="eISSN">1574-0218</idno>
<imprint><publisher>Springer Netherlands</publisher>
<pubPlace>Dordrecht</pubPlace>
<date type="published" when="2009-12-01">2009-12-01</date>
<biblScope unit="volume">43</biblScope>
<biblScope unit="issue">4</biblScope>
<biblScope unit="page" from="329">329</biblScope>
<biblScope unit="page" to="354">354</biblScope>
</imprint>
<idno type="ISSN">1574-020X</idno>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">1574-020X</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Annotation</term>
<term>Assessment</term>
<term>Computational linguistics</term>
<term>Corpus</term>
<term>Corpus annotation</term>
<term>Dialog system</term>
<term>Evaluation</term>
<term>French</term>
<term>Speech processing</term>
<term>Speech understanding</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr"><term>Annotation de corpus</term>
<term>Evaluation</term>
<term>Français</term>
<term>Linguistique informatique</term>
<term>Traitement automatique de la parole</term>
</keywords>
<keywords scheme="mix" xml:lang="en"><term>Annotation</term>
<term>Corpus</term>
<term>Dialog system</term>
<term>Evaluation</term>
<term>Speech understanding</term>
</keywords>
</textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Abstract: The aim of the French Media project was to define a protocol for the evaluation of speech understanding modules for dialog systems. Accordingly, a corpus of 1,257 real spoken dialogs related to hotel reservation and tourist information was recorded, transcribed and semantically annotated, and a semantic attribute-value representation was defined in which each conceptual relationship was represented by the names of the attributes. Two semantic annotation levels are distinguished in this approach. At the first level, each utterance is considered separately and the annotation represents the meaning of the statement without taking into account the dialog context. The second level of annotation then corresponds to the interpretation of the meaning of the statement by taking into account the dialog context; in this way a semantic representation of the dialog context is defined. This paper discusses the data collection, the detailed definition of both annotation levels, and the annotation scheme. Then the paper comments on both evaluation campaigns which were carried out during the project and discusses some results.</div>
</front>
</TEI>
<affiliations><list><country><li>France</li>
</country>
<region><li>Grand Est</li>
<li>Lorraine (région)</li>
<li>Île-de-France</li>
</region>
<settlement><li>Orsay</li>
<li>Vandœuvre-lès-Nancy</li>
</settlement>
</list>
<tree><country name="France"><region name="Île-de-France"><name sortKey="Bonneau Maynard, Helene" sort="Bonneau Maynard, Helene" uniqKey="Bonneau Maynard H" first="Hélène" last="Bonneau-Maynard">Hélène Bonneau-Maynard</name>
</region>
<name sortKey="Bonneau Maynard, Helene" sort="Bonneau Maynard, Helene" uniqKey="Bonneau Maynard H" first="Hélène" last="Bonneau-Maynard">Hélène Bonneau-Maynard</name>
<name sortKey="Denis, Alexandre" sort="Denis, Alexandre" uniqKey="Denis A" first="Alexandre" last="Denis">Alexandre Denis</name>
<name sortKey="Denis, Alexandre" sort="Denis, Alexandre" uniqKey="Denis A" first="Alexandre" last="Denis">Alexandre Denis</name>
<name sortKey="Quignard, Matthieu" sort="Quignard, Matthieu" uniqKey="Quignard M" first="Matthieu" last="Quignard">Matthieu Quignard</name>
<name sortKey="Quignard, Matthieu" sort="Quignard, Matthieu" uniqKey="Quignard M" first="Matthieu" last="Quignard">Matthieu Quignard</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 003930 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 003930 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Wicri/Lorraine |area= InforLorV4 |flux= Main |étape= Exploration |type= RBID |clé= ISTEX:62FE38FFD0D92679441FEF1AFB36859AC3BC3A98 |texte= MEDIA: a semantically annotated corpus of task oriented dialogs in French }}
This area was generated with Dilib version V0.6.33. |